Communication and Streaming Complexity of Approximate Pattern Matching

نویسنده

  • Tatiana A. Starikovskaya
چکیده

We consider the approximate pattern matching problem. Given a text T of length 2n and a pattern P of length n, the task is to decide for each prefix T [1, j] of T if it ends with a string that is at the edit distance at most k from P . If this is the case, we must output the edit distance and the corresponding edit operations. We first look at the communication complexity of the problem. We show the following: If Alice and Bob both share the pattern and Alice holds the first half of the text and Bob the second half, then the deterministic one-way communication complexity of the problem is Θ(k logn). If Alice holds the first half of the text, Bob the second half of the text, and Charlie the pattern, then there is a deterministic one-way communication protocol that uses O(k √ n logn) bits. We then develop the first sublinear-space streaming algorithm for the problem. There exists a streaming algorithm that solves the problem in O(k8 √ n log6 n) space. The worst-case time complexity of the algorithmO((k2 √ n+k13)·log4 n) per arrival. The algorithm is randomised with error probability at most 1/poly(n). 1998 ACM Subject Classification F.2 Analysis of Algorithms and Problem Complexity

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum Matchings in Dynamic Graph Streams and the Simultaneous Communication Model

We study the problem of finding an approximate maximum matching in two closely related computational models, namely, the dynamic graph streaming model and the simultaneous multi-party communication model. In the dynamic graph streaming model, the input graph is revealed as a stream of edge insertions and deletions, and the goal is to design a small space algorithm to approximate the maximum mat...

متن کامل

On the communication and streaming complexity of maximum bipartite matching

Consider the following communication problem. Alice holds a graph GA = (P,Q,EA) and Bob holds a graph GB = (P,Q,EB), where |P | = |Q| = n. Alice is allowed to send Bob a message m that depends only on the graph GA. Bob must then output a matching M ✓ EA [ EB . What is the minimum message size of the message m that Alice sends to Bob that allows Bob to recover a matching of size at least (1 ✏) t...

متن کامل

The streaming k-mismatch problem

We consider the streaming complexity of a fundamental task in approximate pattern matching: the k-mismatch problem. It asks to compute Hamming distances between a pattern of length n and all length-n substrings of a text for which the Hamming distance does not exceed a given threshold k. In our problem formulation, we report not only the Hamming distance but also, on demand, the full mismatch i...

متن کامل

Approximate Hamming Distance in a Stream

We consider the problem of computing a (1+ε)-approximation of the Hamming distance between a pattern of length n and successive substrings of a stream. We first look at the one-way randomised communication complexity of this problem. We show the following: If Alice and Bob both share the pattern and Alice has the first half of the stream and Bob the second half, then there is an O(ε−4 log2 n) b...

متن کامل

Streaming Communication Protocols

We define the Streaming Communication model that combines the main aspects of communication complexity and streaming. We consider two agents that want to compute some function that depends on inputs that are distributed to each agent. The inputs arrive as data streams and each agent has a bounded memory. Agents are allowed to communicate with each other and also update their memory based on the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017